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Abstract 

Background: Identification of true to breed type animal for conservation purpose is imperative. Breed dilution is 
one of the major problems in sustainability except cases of commercial crossbreeding under controlled condition. 
Breed descriptor has been developed to identify breed but such descriptors cover only "pure breed" or true to the 
breed type animals excluding undefined or admixture population. Moreover, in case of semen, ova, embryo and 
breed product, the breed cannot be identified due to lack of visible phenotypic descriptors. Advent of molecular 
markers like microsatellite and SNP have revolutionized breed identification from even small biological tissue or 
germplasm. Microsatellite DNA marker based breed assignments has been reported in various domestic animals. 
Such methods have limitations viz. non availability of allele data in public domain, thus each time all reference 
breed has to be genotyped which is neither logical nor economical. Even if such data is available but 
computational methods needs expertise of data analysis and interpretation. 

Results: We found Bayesian Networks as best classifier with highest accuracy of 98.7% using 51850 reference allele 
data generated by 25 microsatellite loci on 22 goat breed population of India. The F ST values in the study were 
seen to be low ranging from 0.051 to 0.297 and overall genetic differentiation of 13.8%, suggesting more number 
of loci needed for higher accuracy. We report here world's first model webserver for breed identification using 
microsatellite DNA markers freely accessible at http://cabin.iasri.res.in/gomi/. 

Conclusion: Higher number of loci is required due to less differentiable population and large number of breeds 
taken in this study. This server will reduce the cost with computational ease. This methodology can be a model for 
various other domestic animal species as a valuable tool for conservation and breed improvement programmes. 
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Background 

Breed of a given species are known to emerge over years 
during evolution within a specific ecological niche. Each 
breed is a unique combination of gene in a given gene 
pool and over the period of time with selection for 
survival as well as also for productivity due to human 
intervention. Except cases of commercial crossbreeding 
under controlled condition, the breed dilution is one of 
the major problems in sustainability of the breed. The 
identification of true to breed type animal for conservation 
purpose is imperative. If we conserve crossbred or admix- 
tured breed, its long term sustenance is compromised as 
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breed is not well adapted over period of time to its native 
ecological niche. Cross breeding of native goats with 
exotic breeds of goats (Alpine, Saanen and Boer) has 
shown poor reproductive performance and high mortality 
rate in higher grade crosses thus selective breeding of true 
to the breed type animals is desirable with maintained di- 
versity level for successful conservation and long term 
sustainability of breed [1]. Such identification tool is also 
needed to establish breed products origin in todays global 
market [2]. 

Though breed descriptor has been developed in India 
to identify breed but such descriptors cover only "pure 
breed" type animals which excludes more than 2/3rd of 
population which are either undefined or admixture 
[3-5]. In case of close resemblance of phenotype it be- 
comes subjective to identify the breed. Moreover, when 
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degree of admixture is not so conspicuously visible then 
it is hard to differentiate between true to breed type and 
"admixtured breed". Advent of molecular tools like 
microsatellite and SNP have revolutionized the breed 
identification even from small samples of biological tis- 
sue or germplasm without having ova and semen. In 
case of semen, ova or embryo the breed cannot be iden- 
tified as there are no visible breed descriptors. 

Microsatellite DNA marker based breed identification 
has been reported in various domestic animals like cattle 
[6,7], sheep [8,9], goat [10,11], pig [12], horse [13], dog 
[14] poultry and rabbit [15]. Such methods have limi- 
tations namely, non-availability of allele data in public 
domain, thus each time all reference breed has to be ge- 
notyped which is neither logical nor economical. Even if 
such data is available but computational methods needs 
expertise of data analysis and interpretation. 

The present work aims at development of a model 
web server for breed identification where one need not 
to do genotyping of all referral breeds each time increas- 
ing the cost of molecular level identification. In order to 
achieve this, we have used 51850 allelic data of microsa- 
tellite marker obtained from DNA fingerprinting of 22 
goat breeds on 25 loci across India. This methodology 
demonstrates that it can be used as model for other do- 
mestic animal species and breed for identification and 
conservation for long term sustainability endeavor. 

Implementation 

Genomic DNA isolation and creation of data set 

Blood samples were collected from a total of 1037 unrela- 
ted animals belonging to twenty two different Indian goat 
breeds. The breeds selected were from diverse geogra- 
phical regions and climatic conditions with varying utili- 
ties and body sizes. Genomic DNA was isolated from the 
blood samples by using SDS-Proteinase-K method [16,17]. 

The quality and quantity of the DNA extracted was 
assessed by Nanodrop 1000 (Thermo Scientific, USA) 
before further use. A total of 51850 allelic data generated 
by 25 microsatellite (details can be seen at http://cabin. 
iasri.res.in/gomi/algorithm.html) loci based DNA fin- 
gerprinting on 22 goat breeds i.e. Blackbengal, Ganjam, 
Gohilwari, Jharkhand black, Attapaddy, Changthangi, 
Kutchi, Mehsana, Sirohi, Malabari, Jamunapari, Jhakarana, 
Surti, Gaddi, Marwari, Barbari, Beetal, Kanniadu, Sangam- 
nari, Osmanabadi, Zalawari and Cheghu across India were 
collected. In India, there are 23 registered breeds though 
FAO reports 32 which are due to vernacular name, geo- 
graphical name and synonymous name with language 
diversity. 

Microsatellite DNA markers selection 

We followed ISAG (International Society for Animal 
Genetics) guidelines in marker selection such as (i) at 



least one marker from each chromosome, (ii) if selected 
markers are on same chromosome, then must be on dif- 
ferent arm of the chromosome, (iii) if still they are in the 
same arm then distance must be of 50 cM to ensure in- 
dependent segregation through recombination and (iv) 
PIC (Polymorphism Information Content) value must be 
more than 0.5 to ensure higher information of markers 
in a given population. The data generated using 25 loci 
viz. ILST008, ILSTS059, ETH225, ILSTS044, ILSTS002, 
OarFCB304, OarFCB48, OarHH64, OarJMP29, ILSTS005, 
ILSTS019, OMHC1, ILSTS087, ILSTS30, ILSTS34, 
ILSTS033, ILSTS049, ILSTS065, ILSTS058, ILSTS029, 
RM088, ILSTS022, OarAE129, ILSTS082 and RM4 
(Table 1) was used as standard breed reference at the 
back end of server [17]. 

Data Generation by allele detection and genotyping 

PCR products were mixed in ratio of 1:1.5:2:2 of FAM 
(blue), VIC (green), NED (yellow) and PET (red) labelled 
respectively after determining the optimal pooling ratio 
and dilution ratio for a set of primers. In order to ensure 
size calibration of alleles 0.5 \iL of this mixture was com- 
bined with 0.3 \iL of Liz 500 as internal lane standard 
(Applied Biosystems) and 9.20 \iL of Hi-Di Formamide 
per sample. The resulting mixture was denatured by in- 
cubation for 5 min at 95°C to run on automated DNA 
sequencer of Applied Biosystems (ABI 3100 Avant). The 
electropherograms were drawn through Gene Scan and 
used to extract DNA fragment sizing details using Gene 
Mapper software (version 3.0) (Applied Biosystems). Ge- 
nerated data is numeric in terms of base pair which is 
size of each allele along with genotype (combination of 
allele at every diploid locus). The protocol has been 
described at http://cabin.iasri.res.in/gomi/tutorial.html. 
The obtained allelic data were further analysed using 
FSTAT software (http://www2.unil.ch/popgen/softwares/ 
fstat.htm) to compute relative locus differentiation of each 
breed in the entire dataset. 

Bayesian networks as classifiers 

Classification is a technique to identify class labels for 
instances based on a set of features (attributes). Building 
accurate classifiers from pre-classified data is a very ac- 
tive research topic of machine learning and data mining. 
In last two decades, many classification algorithms have 
been proposed including Naive-Bayes, Neural Network 
(Multilayer Perceptron), Random Forest and Bayesian 
Network based classifiers. 

Naive-Bayes, an effective classifier is easy to construct 
as the structure is given a priori i.e., no structure learn- 
ing procedure is required. It assumes that features are 
independent of each other. Although this assumption is 
not realistic, Naive-Bayes has surprisingly outperformed 
many sophisticated classifiers over a large number of 
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Table 1 List of 25 loci along with the primer pairs 


Locus 


Forward primer 


Reverse primer 


Dye 


Size range 


No. of observed allele 


ILST008 


gaatcatggattttctgggg 


tagcagtgagtgaggttggc 


FAM 


167-195 


12 


ILSTS059 


gctgaacaatgtgatatgttcagg 


gggacaatactgtcttagatgctgc 


FAM 


105-135 


14 


ETH225 


gatcaccttgccactatttcct 


acatgacagccaagctgctact 


VIC 


146-160 


9 


ILST044 


agtcacccaaaagtaactgg 


acatgttgtattccaagtgc 


NED 


145-177 


16 


ILSTS002 


tctatacacatgtgctgtgc 


cttaggggtgtattccaagtgc 


VIC 


113-135 


14 


OarFCB304 


ccctaggagctttcaataaagaatcgg 


cgctgctgtcaactgggtcaggg 


FAM 


119-169 


31 


OarFCB48 


gagttagtacaaggatgacaagaggcac 


gactctagaggatcgcaaagaaccag 


VIC 


149-181 


21 


OarHH64 


cgttccctcactatggaaagttatatatgc 


cactctattgtaagaatttgaatgagagc 


PET 


120-138 


10 


OarJMP29 


gtatacacgtggacaccgctttgtac 


gaagtggcaagattcagaggggaag 


NED 


120-140 


14 


ILSTS005 


ggaagcaatgaaatctatagcc 


tgttctgtgagtttgtaagc 


VIC 


174-190 


9 


ILSTS019 


aagggacctcatgtagaagc 


acttttggaccctgtagtgc 


FAM 


142-162 


11 


0MHC1 


atctggtgggctacagtccatg 


gcaatgctttctaaattctgaggaa 


NED 


1 79-209 


27 


ILSTS087 


agcagacatgatgactcagc 


ctg cctcttttcttg a g ag c 


NED 


142-164 


11 


ILSTS30 


ctgcagttctgcatatgtgg 


cttagacaacaggggtttgg 


FAM 


159-179 


12 


ILSTS34 


aagggtctaagtccactggc 


gacctggtttagcagagagc 


VIC 


153-185 


15 


ILSTS033 


tattagagtggctcagtgcc 


atgcagacagttttagaggg 


PET 


151-187 


25 


ILSTS049 


caattttcttgtctctcccc 


gctgaatcttgtcaaacagg 


NED 


160-184 


13 


ILSTS065 


gctgcaaagagttgaacacc 


aactattacaggaggctccc 


PET 


105-135 


16 


ILSTS058 


gccttactaccatttccagc 


catcctg a ctttg g ctg tg g 


PET 


136-188 


27 


ILSTS029 


tgttttgatggaacacagcc 


tggatttagaccagggttgg 


PET 


148-191 


23 


RM088 


gatcctcttctgggaaaaagagac 


cctgttgaagtgaaccttcagaa 


FAM 


109-147 


19 


ILSTS022 


agtctgaaggcctgagaacc 


cttacagtccttggggttgc 


PET 


186-202 


9 


OARE129 


aatccagtgtgtgaaagactaatccag 


gtagatcaagatatagaatatttttcaacacc 


FAM 


130-175 


23 


ILSTS082 


ttcgttcctcatagtgctgg 


agaggattacaccaatcacc 


PET 


100-136 


19 


RM4 


cagcaaaatatcagcaaacct 


ccacctgggaaggccttta 


NED 


104-127 


12 



datasets, especially where the features are not strongly 
correlated [18]. Bayesian Network (BN) is a kind of un- 
restricted classifier. A common feature of Naive Bayes is 
that the class node is treated as a special node: the par- 
ent of all the features. However, BN treats the class 
nodes as an ordinary node, it is not necessary a parent 
of all the feature nodes. The learning methods and the 
performance of BN for classification are well described 
by Friedman et al. in 1999 [19]. It has powerful pro- 
babilistic representation for classification. A Bayesian 
network B which is a graphical model that encodes a 
probability distribution P B (Ai,A 2 , ...,A W C) from a given 
training set. The resulting model can be used so that, 
given a set of attributes a 1} a 2 , a m the classifier based 
on B returns the label/class c which maximizes the pos- 
terior probability, i.e. 

P B (c\ai,a 2 , ...,a n ) 

Let D = {vL lt u 2 , u n } denotes the training data set. 
Here, each is a tuple of the form (a[,a l 2 , ...,a l n ,c l ) 



which assigns values to the attributes A lf A 2) ...,A n and 
to the class variable C. The log likelihood function, 
which measures the quality of learned model, can be 
written as 

LL(B\D) = YZi lo &*VWv4>---A) 

The first term in above equation measures efficiency 
of network B to estimate the probability of a class given 
set of attribute values. The second term measures how 
well network B estimates the joint distribution of the at- 
tributes. Since the classification is determined based on 
P B {C\Ai,A 2) ...,A n ), only the first term is related to the 
score of the network as a classifier i.e., its predictive ac- 
curacy. This term is dominated by the second term, 
when there are many observations. As n grows larger, 
the probability of each particular assignment to A lt 
A 2) ...,A n becomes smaller, since the number of possible 
assignments grows exponentially in n. In our study, 
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Table 2 Performance of different classifiers 



Method 


Sensitivity 


Specificity 


Accuracy 


MCC 


FDR 


Bayes NET 


0.858 


0.993 


0.987 


0.851 


0.142 


Naive Bayes 


0.404 


0.972 


0.946 


0.376 


0.596 


Multilayer- 
Perceptron 


0.450 


0.974 


0.950 


0.424 


0.550 


Random Forest 


0.682 


0.985 


0.971 


0.667 


0.318 



The best performing classifier is represented in bold. 



number of feature (n) are the number of alleles (two 
alleles per locus) i.e. 50 and the total number of samples is 
1037 which includes 22 breeds (classes). Prediction per- 
formance of a Bayesian network has also been compared 
with Multilayer Perceptron [20] and Random forest 
algorithm [21]. 

In this study, WEKA machine learning workbench 
with extensive collection of machine learning algorithms 



and data pre-processing methods was used for classifica- 
tion and prediction [22] . 

Assessment of the prediction accuracy 

The best model was selected using various statistical 
measures viz, sensitivity, specificity, precision or positive 
predictive value (PPV), negative predictive value (NPV), 
accuracy, false discovery rate (FDR) and Mathews cor- 
relation coefficient (MCC). Accuracy estimate was ob- 
tained using five-fold cross-validation technique [23]. 
For five-fold cross validation technique, the total ob- 
servations were divided into five parts. Training was 
done with four sets of observations and testing with 
one set. The same was repeated such that each set 
got the opportunity to fall under the test set. Accu- 
racy for each was recorded and the averages of all 
these five accuracies were reported. The measures are 
defined as follows: 
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ACTUAL GOAT BREEDS 



Bb-Blackbengal; G-Ganjam; Gw-Gohilwari; Jb-Jharkhandblack; At-Attapaddy; Ch-Changthangi; K-Kutchi; M-Mehsana; 
Si-Sirohi; Mb-Malabari; Jp-Jamunapari; J-Jhakarana; Su-Surti; G-Gaddi; Mw-Marwari; B-Barbari; Be-Beetal; Kn- 
Kanniadu; Sn-Sangamnari; Ob-Osmanabadi; Zw-Zalawari; C-Cheghu 



Figure 1 Confusion matrix to show prediction power of BayesNet for each goat breed. 
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Sensitivity or TP Rate = TP /{TP + FN) Specificity = TN/{FP + TN) 
PPV = TP I (TP + FP) NPV = TN/(TN + FN) 

( TP + TN) 

Accuracy = — — - FDR = FP/(FP + TP) 

J (TP + FP + TN + FN) /v 7 

MCC _ (TP* TN-FP* FN) 



^{TP + FP)(TP + FN)(TN + FP)(TN + FN) 



where TP = True Positive, TN = True Negative, FP = 
False Positive, FN = False Negative. 

Web implementation 

The server is developed using CGI-Perl script, Hyper 
Text Markup Language (HTML) and Java Scripts to 
make it more user-friendly and launched using open 
source web server software program, Apache. Other 
models like Random Forest, Multiple Layer Perceptron 
were logically excluded in web implementation ensuring 
objectivity of identification accuracy. The user needs to 
submit the microsatellite allelic data having numeric values 
in base pairs at http://cabin.iasri.res.in/gomi/gomi.html. 
The data can also be uploaded either using xsv or .txt for- 
mat or direct entry in the submission form. The server 
has tutorial for the users for easy understanding with a 
sample data at http://cabin.iasri.res.in/gomi/tutorial.html. 

Results and discussion 

In order to evaluate the performance of Bayesian Net- 
work classifier with respect to other popular classifiers 
such as Naive Bayes, Multilayer Perceptron and Random 
Forest, were trained and tested using five-fold cross 
validation and prediction performance measures were 
averaged over five test sets. These classifiers were ap- 
plied over the 51850 allelic/microsatellite data of Indian 
goat breeds and it has been observed that Bayes Net- 
work outperformed other methods (viz. Naive Bayes, 
Multilayer Perceptron and Random Forest method) with 
sensitivity (TP Rate), specificity, PPV, NPV, accuracy and 



MCC values as 0.858, 0.993, 0.860, 0.993, 0.987 and 
0.851. The performance of these classifiers is shown in 
Table 2. Confusion matrix to show prediction power of 
Bayesian Network for each goat breed is represented in 
Figure 1. Graphical representation of various evaluation 
measures (sensitivity or TP Rate, accuracy and ROC 
area) over all the 22 breeds of goat gives clear picture of 
the result obtained (Figure 2). The area under ROC 
(total area equals 1) represents the quality of classifica- 
tion. Higher the value better is the classification which is 
also evident from our result. 

Similar case of microsatellite data based breed identi- 
fication using Bayesian method has been found with 
much higher accuracy for example 99.63% accuracy in 
five Spanish sheep breed viz. Churra, Latxa, Castellana, 
Rasa-Aragonesa and Merino using 18 microsatellite 
markers [4]. Similar works have been reported in cattle 
[24], camel [25] and dog [26]. 

The novel approach and methodology developed in 
this study gives higher accuracy which is in similar range 
of earlier studies in cattle [27]. In some reported cases 
number of loci needed for breed identification ranged 
much lower like 3-10 [26,28]. For our study, all the 25 
loci were needed which is due to poor differentiation of 
loci in the breeds. Populations having higher F ST values 
always needed minimum loci. Contrary to this, popula- 
tion having low F ST needs more number of loci and still 
the accuracy is compromised. For example, Murciana 
and Granadina populations with 25 microsatellites of 
low F ST value (0.0432) have been reported with just 80% 
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Figure 2 Graphical representation of various evaluation measures over all the 22 breeds of goat. Bb-Blackbengal; G-Ganjam; 
Gw-Gohilwari; Jb-Jharkhandblack; At-Attapaddy; Ch-Changthangi; K-Kutchi; M-Mehsana; Si-Sirohi; Mb-Malabari; Jp-Jamunapari; J-Jhakarana; Su-Surti; 
G-Gaddi; Mw-Marwari; B-Barbari; Be-Beetal; Kn-Kanniadu; Sn-Sangamnari; Ob-Osmanabadi; Zw-Zalawari; C-Cheghu. 
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Figure 3 Graph of F ST values of each locus. 




accuracy [29]. Contrary to this, in case of horse, where 
F ST was having a range of 0.2 to 0.259, the accuracy has 
been high up to 95%, even with minimum of 3 loci [28]. 

In case of very low F ST like 0.009, the breed identifi- 
cation accuracy has been reported as low as 39-48% in 
four breeds. The poor success in correct breed assign- 
ment is due to weak genetic differentiation and gene 
flow between populations [29]. In our study, the F ST 
values were calculated and were seen to be low ranging 
from 0.051 at 5th locus to 0.297 at 10th locus and 
overall genetic differentiation of 13.8%, suggesting more 
number of loci needed for higher accuracy and we found 
the expected result in our study (Figure 3). In our obser- 
vation when loci number was increased this low F ST was 
compensated for identification accuracy. The relation- 
ship between locus differentiation (F ST ) and accuracy of 
prediction is proportionate. If F ST value in a given popu- 
lation of locus selected are higher (> 0.10) then number 
of locus needed is relatively less. If F ST value of loci in a 
given population is low (<0.05) then more number of 
loci is required to achieve accuracy [26] . 

Poor F ST in Indian goat population is already repor- 
ted in many studies related to goat breeds of India 
[16,30,31]. This is happening due to unplanned and in- 
discriminate mating prevalent in breeding region leads 
to small effective population size or mating between rel- 
atives and consequent genetic drift. The general practice 
of breeding here is to allow few bucks for the whole 
village/flock [30]. For conservation, proper breeding 
strategies must be designed by rotating the bucks in 
their flock since the male:female sex ratio is too low. We 
found Jhakrana, Marwari and Sirohi having lower sensi- 
tivity and MCC (Table 3) which is due to overlapping 
native breeds of tract where mixing of population pre- 
vails in Western India. The low MCC of Jamunapari and 
Marwari population are obviously expected as lot of 



Table 3 Prediction accuracies obtained on twenty two 
breeds of goat 



Breed 




Bayes network 






Sensitivity 


Specificity 


Accuracy (*) 


MCC 


FDR 


Blackbengal 


0.958 


0.998 


0.996 (0.005) 


0.956 


0.042 


Ganjam 


0.958 


0.997 


0.995 (0.005) 


0.946 


0.061 


Gohilwari 


0.958 


0.998 


0.996 (0.005) 


0.956 


0.042 


Jharkhandblack 


0.833 


0.994 


0.986 (0.006) 


0.844 


0.130 


Attapaddy 


0.854 


0.997 


0.990 (0.006) 


0.887 


0.068 


Changthangi 


0.979 


0.998 


0.997 (0.003) 


0.968 


0.041 


Kutchi 


0.978 


0.999 


0.998 (0.005) 


0.977 


0.022 


Mehsana 


0.854 


0.996 


0.989 (0.006) 


0.877 


0.089 


Sirohi 


1.000 


1.000 


1 .000 (0.000) 


1.000 


0.000 


Malabari 


0.917 


0.993 


0.989 (0.006) 


0.884 


0.137 


Jamunapari 


0.458 


0.980 


0.956 (0.003) 


0.467 


0.476 


Jhakarana 


0.625 


0.990 


0.973 (0.011) 


0.671 


0.250 


Surti 


0.750 


0.995 


0.984 (0.008) 


0.803 


0.122 


Gaddi 


0.917 


0.986 


0.983 (0.005) 


0.825 


0.241 


Marwari 


0.667 


0.976 


0.961 (0.021) 


0.597 


0.429 


Barbari 


0.729 


0.995 


0.983 (0.009) 


0.790 


0.125 


Beetal 


0.792 


0.991 


0.982 (0.007) 


0.790 


0.191 


Kanniadu 


0.979 


0.986 


0.986 (0.011) 


0.862 


0.230 


Sangamnari 


0.938 


0.999 


0.996 (0.002) 


0.956 


0.022 


Osmanabadi 


0.979 


0.996 


0.995 (0.006) 


0.947 


0.078 


Zalawari 


1.000 


1.000 


1 .000 (0.000) 


1.000 


0.000 


Cheghu 


0.791 


0.989 


0.981 (0.017) 


0.763 


0.244 


Weighted Avg. 


0.858 


0.993 


0.987 


0.851 


0.142 



*The values in parenthesis are the respective standard deviations computer 
from 5-fold cross validation. 

Data in bold represent the weighted average, where weights are the sample 
sizes for each breed. 
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allele are getting introduced through immigrant goat 
breeds in the respective population [30,31]. 

Conclusion 

Through the present study, we are reporting first web 
server for breed prediction with accuracy of more than 
98% using 22 goat breeds of India. The number of loci 
needed is relatively high due to less differentiable popu- 
lation and large number of breeds taken in this study. 
The web server can be used for other domestic species 
thus relevant for global use. Further studies are war- 
ranted to look for new algorithm to reduce the number 
of loci in prevailing conditions of large number of breeds 
and with lower differentiation especially prevailing in 
"breed melting pot" regions like India and other major 
diversity regions of the world. This server will reduce 
the cost with computational ease. This methodology 
would become a model for all flora and fauna for var- 
iety and breed identification required in improvement, 
conservation, sovereignty issues in trans-border germ- 
plasm movement and management. 

Availability and requirements 

Webserver can be accessed freely at http://cabin.iasri.res. 
in/gomi/ 

Server Name: http://cabin.iasri.res.in/ 
Project home page: http://cabin.iasri.res.in/gomi/ 
Operating system(s): e.g. Platform independent 
Programming language: PERL, Java, PHP 
Other requirements: Internet connectivity 
License: No restrictions on non-commercial/Research use 
Any restrictions to use by non-academics: Non- 
academicians may contact corresponding author 
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